Cache Coherence in Intelligent Memory Systems
نویسندگان
چکیده
As microprocessor clock speeds continue to climb, it has become increasingly di cult to keep computations supplied with data from memory [P97] [GHI95] [MSM97]. One promising solution is to move some of the computation into memory. We focus upon Active Pages, a model of computation that can be implemented inexpensively in conventional DRAMs [OCS98] [OHK99]. Active Page memory systems are meant to accelerate computations on commodity workstations and PCs. An unanswered question with Active Pages, and intelligent memory proposals in general, is how to keep data coherent between cooperating processing elements. In the Active Pages programming model, this issue translates to coherence between the main processor caches and Active Page DRAM. Active Pages is a page-based model of computation which associates simple functions with each page of memory. For example, Java objects stored in memory may have functions bound to the Active Pages to perform pointer chasing for mark-and-sweep garbage collection. By partitioning computations between processor and Active Page memory, data intensive applications can execute up to 1000 times faster than on the same system using conventional memory. Previous implementations used software ushes to prevent cached data from becoming inconsistent with data modi ed by Active Pages in memory. The intelligent memory environment introduces several challenges in designing a coherent system. Because this technology is aimed at commodity desktop processors, even minor hardware changes become costly. The goal is to change as little as possible, incorporating the intelligent memory into existing processor coherence protocols. In addition, intelligent memory systems may have hundreds to thousands of computational units. This can become a burden to the communication network. The Active Pages system only allows sharing between computational units and the main processor, limiting the patterns of sharing in the system. This allows a basic communication network to sustain the coherence tra c from hundreds of processors. In this study, we analyze three methods of communication between the main processor and the Active Page memory system: explicit cache ushes, uncached variables, and hardware coherence mechanisms. We demonstrate that the limited sharing within the Active Page memory system makes hardware coherence mechanisms both e cient and easy to implement. For those who question even minimal modi cation to commodity memory systems, we also demonstrate that software cache management can perform comparably to hardware coherence. Ongoing work evaluates this tradeo between programmability and hardware modi cations.
منابع مشابه
Reduction in Cache Memory Power Consumption based on Replacement Quantity
Today power consumption is considered to be one of the important issues. Therefore, its reduction plays a considerable role in developing systems. Previous studies have shown that approximately 50% of total power consumption is used in cache memories. There is a direct relationship between power consumption and replacement quantity made in cache. The less the number of replacements is, the less...
متن کاملReduction in Cache Memory Power Consumption based on Replacement Quantity
Today power consumption is considered to be one of the important issues. Therefore, its reduction plays a considerable role in developing systems. Previous studies have shown that approximately 50% of total power consumption is used in cache memories. There is a direct relationship between power consumption and replacement quantity made in cache. The less the number of replacements is, the less...
متن کاملIntelligent Machinery: A Heretical View’
Scalable archetypes and RAID have garnered tremendous interest from both scholars and theorists in the last several years. After years of confusing research into courseware, we verify the deployment of the memory bus. We construct new wireless information (ROBING), disproving that cache coherence can be made robust, perfect, and relational.
متن کاملExploiting the Benefits of Multiple-Path Network DSM Systems: Architectural Alternatives and Performance Evaluation
| Modern high performance networks being used for scalable distributed shared memory (DSM) systems support multiple paths to increase bandwidth and/or reduce contention. Such networks violate the constraint of pairwise in-order message delivery implicitly required by many existing directory-based cache coherence protocols. To solve this problem, two alternative strategies are currently used by ...
متن کاملMESI Cache Coherence Simulator for Teaching Purposes
Nowadays, the computational systems (multi and uniprocessors) need to avoid the cache coherence problem. There are some techniques to solve this problem. The MESI cache coherence protocol is one of them. This paper presents a simulator of the MESI protocol which is used for teaching the cache memory coherence on the computer systems with hierarchical memory system and for explaining the process...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Computers
دوره 52 شماره
صفحات -
تاریخ انتشار 2003